OJT Project Report and Demonstration
Under the guidance of
Dr. Dinesh Helwade
August 3, 2025
This On-the-Job Training (OJT) project provided an invaluable opportunity to apply theoretical knowledge of natural language processing, deep learning, and data engineering into a real-world application. The primary objective was to build a pipeline for automatic transcription and summarization of educational and conference video content, aimed at making such content more accessible and searchable.
I would like to express my sincere gratitude to Dr. Dinesh Helwade for his guidance and encouragement throughout the project. It was under his initiative that this project was designed with a vision to open-source transcription and summarization tools for the benefit of educational institutions and research communities.
This project marks a critical step in combining machine learning technology with accessibility goals in education.
Video content has become the most preferred medium for disseminating information — especially in academia, conferences, and online education. However, extracting insights or reviewing content from long-form videos remains time-consuming and inefficient.
Pradnyaa InfoVision, headquartered in Pune, India, is a specialized analytics and consulting firm with over four years of experience delivering data-driven solutions. The company operates across two core domains: Retail Analytics and Biostatistics, offering deep domain expertise and tailored consulting to global clients.
The Retail Analytics team at Pradnyaa InfoVision excels in demand forecasting, leveraging both standard statistical models and cutting-edge machine learning algorithms. Their capabilities span across:
Regular Price Optimization
Markdown Strategy & Optimization
Inventory Optimization
One of the firm’s flagship projects involved partnering with a major U.S. retailer to design and execute a comprehensive price test across the entire U.S. region—demonstrating the company’s global reach and strategic insight.
In the life sciences space, Pradnyaa InfoVision plays a key role in the analysis and processing of all three phases of clinical trials. The Biostatistics division supports pharmaceutical and healthcare companies by providing end-to-end statistical solutions that comply with global regulatory standards.
Beyond analytics, the company also provides strategic consulting for offshore office setup and management in India. This includes team recruitment, operational oversight, and seamless integration with client business processes, enabling clients to establish a strong and scalable presence in India.
| Model | Accuracy | Offline | Multilingual | Ease of Use | Cost |
|---|---|---|---|---|---|
| Whisper | ✅✅✅ | ✅ | ✅✅✅ | ✅✅ | Free |
| Wav2Vec 2.0 | ✅✅ | ✅ | ⚠️ (mostly English) | ✅ | Free |
| Google API | ✅✅✅ | ❌ | ✅✅✅ | ✅✅✅ | Paid |
| DeepSpeech | ✅ | ✅ | ❌ | ✅✅ | Free |
| Kaldi | ✅✅ | ✅ | ✅ (with effort) | ⚠️ Complex | Free |
| Vosk | ✅✅ | ✅ | ✅✅ | ✅✅✅ | Free |
Scan the QR code to view the presentation:
OJT Project – 2025